NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

ARN: Analogical Reasoning on Narratives

Sourati, Zhivar; Ilievski, Filip; Sommerauer, Pia; Jiang, Yifan (May 2024, Transactions of the Association for Computational Linguistics)

Full Text Available
PAIR Diffusion: A Comprehensive Multimodal Object-Level Image Editor

https://doi.org/10.1109/CVPR52733.2024.00822

Goel, Vidit; Peruzzo, Elia; Jiang, Yifan; Xu, Dejia; Xu, Xingqian; Sebe, Nicu; Darrell, Trevor; Wang, Zhangyang; Shi, Humphrey (June 2024, IEEE)

Full Text Available
Transferring Procedural Knowledge Across Commonsense Tasks

Jiang, Yifan; Ilievski, Filip; Ma, Kaixin (September 2023, IOS Press Ebooks)

Full Text Available
In-Context Learning Unlocked for Diffusion Models

Wang, Zhendong; Jiang, Yifan; Lu, Yadong; Shen, Yelong; He, Pengcheng; Chen, Weizhu; Wang, Zhangyang; Zhou, Mingyuan (December 2023, Neural Information Processing Systems)

We present Prompt Diffusion, a framework for enabling in-context learning in diffusion-based generative models. Given a pair of task-specific example images, such as depth from/to image and scribble from/to image, and a text guidance, our model automatically understands the underlying task and performs the same task on a new query image following the text guidance. To achieve this, we propose a vision-language prompt that can model a wide range of vision-language tasks and a diffusion model that takes it as input. The diffusion model is trained jointly on six different tasks using these prompts. The resulting Prompt Diffusion model becomes the first diffusion-based vision-language foundation model capable of in-context learning. It demonstrates high-quality in-context generation for the trained tasks and effectively generalizes to new, unseen vision tasks using their respective prompts. Our model also shows compelling text-guided image editing results. Our framework aims to facilitate research into in-context learning for computer vision. We share our code and pre-trained models at https://github. com/Zhendong-Wang/Prompt-Diffusion.
more » « less
Full Text Available
Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models

Wang, Zhendong; Jiang, Yifan; Zheng, Huangjie; Wang, Peihao; He, Pengcheng; Wang, Zhangyang; Chen, Weizhu; Zhou, Mingyuan (December 2023, Neural Information Processing Systems)

Diffusion models are powerful, but they require a lot of time and data to train. We propose Patch Diffusion, a generic patch-wise training framework, to significantly reduce the training time costs while improving data efficiency, which thus helps democratize diffusion model training to broader users. At the core of our innovations is a new conditional score function at the patch level, where the patch location in the original image is included as additional coordinate channels, while the patch size is randomized and diversified throughout training to encode the cross-region dependency at multiple scales. Sampling with our method is as easy as in the original diffusion model. Through Patch Diffusion, we could achieve ≥2× faster training, while maintaining comparable or better generation quality. Patch Diffusion meanwhile improves the performance of diffusion models trained on relatively small datasets, e.g., as few as 5,000 images to train from scratch. We achieve outstanding FID scores in line with state-of-the-art benchmarks: 1.77 on CelebA-64×64, 1.93 on AFHQv2-Wild-64×64, and 2.72 on ImageNet-256×256. We share our code and pre-trained models in GitHub.
more » « less
Full Text Available
Signal Processing for Implicit Neural Representations

Xu, Dejia; Wang, Peihao; Jiang, Yifan; Fan, Zhiwen; Wang, Zhangyang (November 2022, Advances in neural information processing systems)

Full Text Available
InstantNet: Automated Generation and Deployment of Instantaneously Switchable-Precision Networks

Fu, Yonggan; Yu, Zhongzhi; Zhang, Yongan; Jiang, Yifan; Li, Chaojian; Liang, Yongyuan; Jiang, Mingchao; Wang, Zhangyang; Lin, Yingyan (January 2021, The Design Automation Conference)
null (Ed.)
Full Text Available
Difference-Frequency Generation Terahertz Quantum Cascade Lasers with Surface Grating Outcouplers

https://doi.org/10.1364/CLEO_SI.2018.SF3G.7

Kim, Jae Hyun; Jung, Seungyong; Jiang, Yifan; Fujita, Kazuue; Hitaka, Masahiro; Ito, Akio; Edamura, Tadataka; Belkin, Mikhail A. (May 2018, Conference on Lasers and Electro-Optics, OSA Technical Digest (online))

We report terahertz quantum cascade laser sources based on intra-cavity difference-frequency generation processed into double-metal waveguides with surface-grating outcouplers. Over 112 μW of peak power output is produced at room temperature at 1.9 THz.
more » « less
Full Text Available
Double-metal waveguide terahertz difference-frequency generation quantum cascade lasers with surface grating outcouplers

https://doi.org/10.1063/1.5043095

Kim, Jae Hyun; Jung, Seungyong; Jiang, Yifan; Fujita, Kazuue; Hitaka, Masahiro; Ito, Akio; Edamura, Tadataka; Belkin, Mikhail A. (October 2018, Applied Physics Letters)

Search for: All records